Comparing a Coevolutionary Genetic Algorithm for 
Multiobjective Optimization 

Jason D. Lohn 1 , William F. Kraus 2 , Gary L. Haith 3 
Computational Sciences Division, MS.269-1 
2 QSS Group, Inc., MS 269-3 

1,2 NASA Ames Research Center, Moffett Field, CA, 94035 USA 
3 Narex Dorn Golden, GO, US A 

{ j lohn , bkraus}@email . arc . nasa . gov haithOstanf ordalumni . org 


Abstract — We present results from a study compar- 
ing a recently developed coevolutionary genetic algorithm 
(CGA) against a set of evolutionary algorithms using a suite 
of multiobjective optimization benchmarks. The CGA 
embodies competitive coevolution and employs a simple, 
straightforward target population representation and fit- 
ness calculation based on developmental theory of learn- 
ing. Because of these properties, setting up the additional 
population is trivial making implementation no more diffi- 
cult than using a standard GA. Empirical results using a 
suite of two-objective test functions indicate that this CGA 
performs well at finding solutions on convex, nonconvex, 
discrete, and deceptive Pareto-optimal fronts, while giv- 
ing respectable results on a nonuniform optimization. On 
a multimodal Pareto front, the CGA finds a solution that 
dominates solutions produced by eight other algorithms, 
yet the CGA has poor coverage across the Pareto front. 

I. Introduction 

Cooperation and competition between populations of 
organisms in nature has inspired researchers to incor- 
porate coevolutionary dynamics into genetic algorithms. 
The common element in these approaches is the inclusion 
of one or more additional populations. A growing body 
of research explores coevolutionary approaches that capi- 
talize on this dynamic quality (for review, see [12]). This 
coevolutionary work has largely concentrated on competi- 
tive interactions. The interactions can be between individ- 
uals that compete in a symmetric game-like context [13], 
[15], or between populations of different types of individ- 
uals that compete in predator/prey type relationships [5], 
[10], [9], [6], [14]. In these cases, individuals are rewarded 
if they defeat the individuals with which they compete. 
These interactions can support “arms-races” in which the 
individuals force each other to become increasingly com- 
petent. 

A few studies have investigated the role of cooperation 
and how it can help solve some problems endemic to evo- 
lutionary methods, like the difficulty of choosing an appro- 
priate encoding for the individuals [11] and the difficulty of 
decomposing composite problems [2]. Other studies have 
found that a balance of cooperation and competition is 
necessary to prevent evolutionary algorithms from getting 
trapped in local minima, or “Mediocre Stable States” [3]. 

In this paper we describe a coevolutionary genetic al- 
gorithm (CGA) whose fitness calculations are inspired by 


developmental theory. The fundamental idea is to use co- 
evolutionary dynamics to automatically regulate the level 
of difficulty, from easy to hard, posed by a population of 
tests. We then describe multiobjective optimization prob- 
lems and a suite of test functions that we use to judge the 
performance of the CGA. Empirical results from the CGA 
runs are presented and compared to previously-published 
results, 

II. Coevolutionary GA 

The coevolutionary algorithm w r e present is based on 
an algorithm used in previous evolvable hardware appli- 
cations [7], [8], and is based on competition between two 
populations. The population of candidate solutions, or 
trial population, is represented and manipulated much the 
same as the main population in a standard genetic algo- 
rithm. The second population, or target population, con- 
sists of target objective vectors (TO Vs) - vectors contain- 
ing targets for the individual objectives to be optimized. 
An overview of the algorithm is presented in Figure 1. 

The population of TOVs is used to encapsulate the level 
of difficulty that the trial population faces. Under the con- 
trol of the genetic algorithm, the TOVs evolve from easy 
to difficult based on the level of proficiency of the trial 
population. The algorithm designer need only specify two 
TOVs: an easy TOV and a difficult TOV, the latter being 
the ultimate goal of the run (analogous to stopping criteria 
in standard evolutionary algorithms). The CGA seeds the 
easy TOVs into early generations of the run to guarantee 
that the coevolutionary dynamic will be used - as we shall 
see, if all TOVs were too difficult for generation zero in- 
dividuals, there would be no competitive mechanism and 
hence no fitness feedback between populations. 

Each TOV consists of a set of target objectives that act 
like thresholds: all thresholds must be met or exceeded in 
order for the TOV to be “solved,” and hence gain fitness 1 . 
The general form of the fitness calculations are as follows. 
Trial individuals are rewarded for solving difficult TOVs. 
The most difficult TOV at a given generation is defined 

1 This all-or-nothing property can be relaxed to accommodate par- 
tial solutions, however that version of the algorithm will be reported 
on in future work. 
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Fig. 2. Example of Pareto front when minimizing two objectives 
/i and /2. Nondominated solutions are represented as hollow 
circles (o) and dominated solutions by filled circles (•). 


to be the one that only one trial individual can solve. 
Such a TOV garners the highest fitness score. TOVs that 
are unsolvable, or are very easy to solve by the current 
trial population, are given low fitness scores. Fitness of 
individual in the trial population is computed as follows. 
Individual i “plays” each TOV in the second population 
and a score, s*, is computed: 

_1 

# trial 
jetoVi individuals 
that solve to vj 

where to\q is the set of TOV indexes such that individual 
i solves tovj. Note that the denominator in the above 
fraction is guaranteed to be greater than or equal to one 
due to the restriction on j. Then s* is normalized linearly 
between its upper and lower bounds such that 0.0 is the 
best score and 1.0 the worst: 

F (trial individual*) = 1.0 — Si/M 2 

where M 2 is the size of the TOV population. The effect 
of s is to reward trial individuals that solve the more diffi- 
cult TOVs. A TOV has the greatest difficulty level when 
exactly one trial individual can solve it. If many trial 
individuals can solve a particular TOV, the fitness contri- 
bution in s is shared among the trial individuals [14]. 

Fitness of an individual TOV is computed as follows. 
Let Xj denote the number of trial individuals that solve 
tovj, and M\ be the trial population size. The fitness is 
essentially Xj, scaled and normalized, with a tractability 
constraint: 

1.0 Xj = 0 

~ 1°) X i - 1 

The tractability constraint gives a target vector a score 
of 1.0 (the “worst” score) when no trial individuals can 


solve it. This puts pressure on the TOV population to pose 
difficult, yet solvable problems to the trial population. 

Ill, Multiobjective Optimization 

The notion of weighing tradeoffs is common to prob- 
lems in everyday life, science, and engineering. Buying 
a less expensive product might tradeoff product quality 
for the ability to buy more of something else. Adding 
an additional science instrument to a spacecraft trades 
off increased costs for increased science return. Hard opti- 
mization problems typically require many decisions on the 
input side and many objectives to optimize on the output 
side. The set of objectives forms a space where points 
in the space represent individual solutions. The goal of 
course is to find the best or optimal solutions to the opti- 
mization problem at hand. Pareto optimality defines how 
to determine the set of optimal solutions. A solution is 
Pareto-optimal if no other solution can improve one ob- 
jective function without a simultaneous deterioration of at 
least one of the other objectives. A set of such solutions is 
called the Pareto-optimal front. An example of a Pareto 
front is seen in Figure 2. 

Evolutionary algorithms (EAs) have recently attracted 
much attention in the exploration of Pareto-optimal 
fronts. It is claimed that EAs are the preeminent search 
algorithms for such tasks [16]. 

Below we briefly touch on relevant terminology and def- 
initions regarding multiobjective optimization problems 
(following [16])- The set of input parameters, or decision 
variable, is called the decision vector. The set of objective 
functions that measure the performance of the system is 
called the objective vector . In an evolutionary algorithm 
framework, a decision vector naturally corresponds to a 
candidate solution, and the functions comprising the ob- 
jective vector are typically incorporated, by various tech- 
niques, into the fitness function(s). 

A dominance test is a way to measure the relative per- 
formance among decision vectors. Given two decision vec- 



F(tovj ) = 




tors a and b, a dominates b if and only if a ties or ex- 
ceeds b’s performance on every objective, and there exists 
at least one objective where a’s performance strictly ex- 
ceeds b’s. Using this test, we can pare down any given set 
of decision vectors and find the the set of nondominated 
decision vectors. Such a set is said to form the nondomi- 
nated front. If the nondominated set resulted from testing 
every possible decision vector, then the nondominated set 
is the Pareto-optimal front. 

A coverage test adds a test for equality to the dominance 
test. Given two decision vectors a and b, a covers b if and 
only if a dominates b or a’s objective vector is identical to 
b’s. The coverage test is used to compare two algorithms 
as follows. The function C(A, B) computes the percentage 
of algorithm B’s solutions that are covered by solutions 
produced by A. 

The above tests (see [16] for formal definitions) are used 
assess the ability of algorithms to optimize a set of de- 
cision vectors. The dominance test will be used to cull 
dominated solutions produced by a given algorithm. The 
coverage test will be used to compare the solutions pro- 
duced by algorithms head-to-head. 

IV. Experimental Setup 

We follow the suite of multiobjective test functions 
and empirical results presented in [16]. Briefly, there 
were seven multiobjective evolutionary algorithms and 
one random search algorithm executed on six test func- 
tions. The algorithms compared in [16] were: random 
search (RAND), Fonseca and Fleming’s multiobjective 
GA (FFGA), the Niched Pareto GA (NPGA), Hajela 
and Lin’s weighted sum approach (HLGA), the Vector 
Evaluated GA (VEGA), the Nondominated Sorting GA 
(NSGA), a single-objective EA using weighted-sum ag- 
gregation (SOEA), and the Strength Pareto GA (SPEA). 
The CGA described above is denoted COEV. 

The test functions, 71-76, were chosen because they 
provide a range of difficulties for multiobjective optimiza- 
tion (e.g. } multimodality, deception, isolated optima). In 
each optimization, it is desired to minimize the objective 
vector by find its Pareto-optimal front. 

To allow a direct comparison to the results in [16], we 
followed the run setup as closely as possible: thirty CGA 
runs were executed for each test function using the pa- 
rameters shown in Table I. To compute the nondomi- 
nated front for the CGA, we did the following. For each 
CGA run, we collected all the output objective vectors 
((/ij/ 2 )) corresponding to the individuals evaluated dur- 
ing the run. For each test function, the output objec- 
tive vectors from five randomly-selected runs were com- 
bined and a domination test removed all the dominated 
solutions. For the algorithm-to- algorithm coverage test 
(function C described above), we used the results from the 
thirty runs as follows. The nondominated set from each 
run was computed. Then the domination test was per- 
formed by pitting the nondominated set from algorithm 


Number of generations 

250 

Trial population size 

100 

Target objective vector population size 

100 

Crossover rate (both populations) 

0.8 

Mutation rate (both populations) 

0.01 


TABLE I 

COE VOLUTIONARY GA PARAMETERS. 


A , run i , against the nondominated set from algorithm B, 
run i. Statistics, in the form of boxplots (described be- 
low), were computed using the resulting thirty C values. 
Both C(A t B) and C(B, A) were computed as they may be 
different. 

V. Results 

As noted in the literature, comparing multiobjective 
optimization algorithms against each other can be diffi- 
cult. One would like an algorithm to minimize the dis- 
tance to the Pareto-optimal front and provide uniform 
coverage of the Pareto-optimal front for a wide range of 
values. Thus, comparisons become multiobjective opti- 
mization problems themselves: is an algorithm that finds 
a handful of Pareto-optimal solutions better than an al- 
gorithm that finds a wide, uniform distribution of near 
Pareto-optimal solutions? With this in mind we present 
the experimental results. 

Figures 3-8 show the results from the six test functions. 
On each figure the optimal Pareto front is drawn as a 
curve, data points for the eight comparison algorithms are 
shown in grav^ , and the data points from the CGA runs 
(COEV) are shown as black circles. 

In general, the results show that the CGA is a rela- 
tively strong performer: it always exceeds random search 
and has qualitatively good performance against strong al- 
gorithms such as SPEA and NSGA. On the first two test 
functions, the CGA has the qualitatively best distribu- 
tion and alignment to the Pareto-optimal curve. In the 
third test function, it performs on par with SPEA. In the 
fourth test function, a multimodal surface, the CGA has 
poor coverage, yet find a solution that dominates nearly all 
the others. On the deceptive test function, 7^, the CGA 
provides relatively excellent coverage except at low /1 val- 
ues, with SOEA doing better there. On the nonuniform 
test function, 76, SPEA is the only algorithm to any find 
Pareto-optimal solutions, and is able to span the width of 
the front. However CGA provides near-optimal solutions, 
with good coverage at high values of / 1 . 

The head-to-head algorithm comparisons using the C 
metric are shown in the boxplots of Figure 9. Each boxplot 

2 Data used to plot the curves from the non-coevolutionary runs 
was obtained from the authors of [16]. Differences between the 
curves in this paper and in [16] can be seen because the data was 
sampled randomly. 
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Fig. 3. Test function 71 (convex). 



Fig. 5. Test function 75 (discrete). 
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Fig. 4. Test function 75 (nonconvex). 


Fig. 6. Test function 74 (multimodal). 


contains results from each of the six test functions: the 
dark dash is the median, the the top of the box is the 
upper quartile, the bottom of the box is the lower quartile. 
As can be seen, on all test functions the solutions found by 
the CGA statistically cover the solutions found by RAND, 
FFGA, NPGA, HLGA, and VEGA. The CGA’s weakest 
results are on Te against NSGA, SOEA, SPEA. 

VI. Conclusion 

Multiobjective optimization is clearly one of the most 
important class of problems in science and engineering. 
Solution techniques that are effective at searching what 
are typically vast search spaces, and finding a selection of 
Pareto-optimal solutions are very desirable. In this paper 
we presented a coevolutionary genetic algorithm inspired 
by development learning theory, and compared it empir- 
ically to seven other evolutionary search techniques for 
multiobjective optimization. In terms of algorithm de- 


sign, CGA is no more difficult to design and implement 
than a typical genetic algorithm. In fact, because the fit- 
ness functions are identical across application domains, 
implementation may be viewed as being easier. The re- 
sults show that the CGA performed very well compared to 
the other evolutionary algorithms and random search. On 
four of the six functions, it could be argued that the CGA 
qualitatively performed on par with or outperformed the 
other algorithms. Missing from this study is a comparison 
against traditional optimizing algorithms, which we leave 
for future work. 
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Fig. 7. Test function 7s (deceptive). 



Fig. 8. Test function T& (nonuniform). 
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